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[57] ABSTRACT 

In a program data stack in a computer system, every stack 
is implemented as two substacks, one to contain references 
and one to contain primitive data. In this manner, whether a 
piece of information on a stack is a reference or a primitive 
value is easily determined according to which substack it 
resides in. Each substack is itself a full-fledged stack data 
structure with a stack base, a stack pointer, and a stack limit. 
In the preferred embodiment, there is also a frame pointer 
for each substack. The normal instruction set is modified so 
that instructions that operate with references use the refer- 
ence stack whereas instructions that operate with data use 
the primitive data stack. Specifically, during compilation, or 
loading, all instructions which can operate indiscriminately 
with either references or data are replaced by equivalent, but 
reference-specific or data-specific instructions. With this 
arrangement, garbage collection algorithms can locate all 
references stored within a stack, since the references are all 
in a more or less contiguous region of memory without 
interspersed primitive data. 

22 Claims, 8 Drawing Sheets 
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SYSTEM AND METHOD FOR ASSISTING access objects. Such objects can themselves contain refer- 

EXACT GARBAGE COLLECTION BY ences to data, such as integers or floating-point numbers and 

SEGREGATING THE CONTENTS OF A to yet other objects. In this manner, a chain of references can 

STACK INTO SUB STACKS be created, each reference pointing to an object which, in 

5 turn, points to another object. 

CROSS REFERENCES TO RELATED Garbage collection techniques determine when a data 

APPLICATIONS structure is no longer reachable by an executing program, 

™. i- • *u f *u * * r ^ either directly or through a chain of pointers. When a data 

This application is the one of three patent applications . 3 , . . , . ^ 4 

c, . j*u *u a ™ i «,««.«.• ~a structure is no longer reachable, the memory that the data 

filed on an even date herewith and commonly assigned, , n . & , , . ' \ . , 

including pending U.S. patent application Ser. No. 08/838, 10 stmc * re ocoipies can be reclaimed and reused even if it has 

971, entitled "APPARATUS AND METHOD FOR ASSIST- n ° l ^ en ex P hcitl y 1 f e ^ Dca ^ sd li by the J? ^ 

ING EXACT GARBAGE COLLECTION BY USING A f cch ™> collecti0n techniques should be able to, 

STACK CACHE OF TAG BITS", and pending U.S. patent first > ldentlf y reference * that ™ direcll y accessible to the 

application Ser. No. 08/838,971 entitled "METHOD AND „ executing program and, second, given a reference to an 

APPARATUS FOR DELTA ENCODING LIVE POINTER object ' ldentlt y references contained within that object, 

MASKS FOR PROGRAM DATA STACKS". The subject lhereb y f°J in Z the garbage C ° UeCt ° r t0 traCC transitivel y 

matter of the above-identified copending patent applications chams of references - 

is hereby incorporated by reference. A subclass of garbage collectors known as "relocating" 

garbage collectors, relocate data structures that are still 

FIELD OF THE INVENTION 20 reachable by the executing program. Relocation of a data 

structure is accomplished by making a copy of the data 

Hie present invention relates to data processing systems, structure in anoth er region of memory, then replacing all 

and, more specifically, to apparatus and methods for assist- reac hable references to the original data structure with 

ing with memory management. references to the new copy. The memory occupied by the 

nAr>i/pnAirun tup iM^/nvmnM 25 original data structure may then be reclaimed and reused. 

BACKGROUND OF THE INVENTION „ f . , n * , tU , . U1 

Relocating garbage collectors have the desirable property 

One of the most important resources within a data pro- that they compact the memory used by the executing pro- 
cessing system is the amount of memory directly available gram and thereby reduce memory fragmentation, 
for utilization by tasks during execution. Accordingly, much 3q Because relocating garbage collectors modify references 
interest has been directed to efficient utilization of memory during the garbage collection process, it is important that 
and memory management strategies. An important concept references be identified and distinguished from non- 
in memory management is the manner in which memory is reference information, such as data, which cannot be modi- 
allocated to a task, deallocated and then reclaimed. fied for garbage collection purposes. Consequently, fully 

Memory deallocation and reclamation may be explicit and 35 relocating garbage collectors belong to a subclass of garbage 

controlled by an executing program, or may be carried out collection methods, known as "exact" garbage collectors, 

by another special purpose program which locates and which require knowledge whether a given piece of infor- 

reclaims memory which is unused, but has not been explic- mation in memory is a reference or a primitive value. For the 

itly deallocated. "Garbage collection" is the term used in purposes of this document, a "primitive value" or "primitive 

technical literature and the relevant arts to refer to a class of 40 data" is defined as data which does not function as a 

algorithms utilized to carry out storage management, spe- reference, such as an integer or lloating point number, 

cifically automatic memory reclamation. There are many i n order to facilitate the use of exact garbage collection, 

known garbage collection algorithms, including reference some computing systems use a "tagged" representation for 

counting, mark-sweep, and generational garbage collection all memory locations to positively distinguish references 

algorithms. These, and other garbage collection techniques, 45 from data. In such systems, references and primitive data, 

are described in detail in a book entitled "Garbage such as integers and floating-point numbers, are represented 

Collection, Algorithms For Automatic Dynamic Memory in memory in a manner that a reference always has a 

Management" by Richard Jones and Raphael Lins, John different bit pattern than a primitive value. This is generally 

Wiley & Sons, 1996. Unfortunately, many of the described done by including tag bits in each memory location in 

techniques for garbage collection have specific requirements 50 addition to the bits holding the memory location value. The 

which cause. implementation problems, as described herein. tag bits for a memory location holding a reference value are 

For the purpose of this specification, the term "object" always different from the tag bits for a memory location 

refers to a data structure that is represented in the memory holding a datum value. The MIT LISP Machine was one of 

of a computing system. This usage of the term object is the first architectures which used garbage collection and had 

distinct from the usage of the term "object" in "object- 55 a single stack with explicitly tagged memory values. Its 

oriented" systems, wherein objects have associated successor, the Symbolics 3600, commercially available from 

"methods", i.e. pieces of code associated with them, which Symbolics, Inc., Cambridge, Mass., also used explicitly 

code may be invoked through a reference to the object. tagged memory values. The Symbolics 3600 was able to 

However, the present invention is applicable to such object- accommodate either a 32-bit reference or a 32-bit primitive 

oriented systems. 60 datum in a single stack by using 36 bit words, 4 bits of which 

An object may be located by a "reference", or a small werc permanently allocated for tagging information. As 

amount of information that can be used to access the data such, the bit pattern within a 36-bit word for a reference was 

structure. One way to implement a reference is by means of always distinguishable from the bit pattern for a primitive 

a "pointer" or "machine address", which uses multiple bits integer or floating-point value. 

of information, however, other implementations are pos- 65 Permanently allocated tag bits have the disadvantage that 

sible. General-purpose programming languages and other they consume memory space that might otherwise be used to 

programmed systems often use references to locate and store computational data. Consequently, many computer 
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systems use an "untagged" data representation in which the does not accommodate tagged data or which does not 

entire memory word is devoted to representing the datum support segregation of operands into different stacks pre- 

value. In such systems, the same bit pattern might represent sents implementation problems, 
a reference or a primitive value. As a result, in such systems, 

the distinction between references and primitive values is 5 SUMMARY OF THE INVENTION 
often made from external considerations or representations, 

such as the instruction that is to operate on the data, or the A central fealure of thls invention is to implement every 
position of the data within an object. However, the use of stack as two substacks, one to contain references and one to 
external considerations to make this distinction was not contain primitive data. In this manner, whether a piece of 
possible in all systems. information on a stack is a reference or a primitive value is 
For example, the Java programming language was origi- easily determined according to which substack it resides in. 
nally designed for use in systems using untagged data Each substack is itself a full-fledged stack data structure 
representation. The Java programming language is described with a stack base, a stack pointer, and a stack limit. In the 
in detail in the text entitled "The Java Language Specifica- preferred embodiment, there is also a frame pointer for each 
tion" by James Gosling, Bill Joy and Guy Steele, Addison- substack. The normal instruction set is modified so that all 
Wesley, 1996. The Java language was designed to run on 15 instructions that operate with references use the reference 
computing systems with characteristics that are specified by stack whereas all instructions that operate with data use the 
the Java Virtual Machine Specification which is described in primitive data stack. In accordance with one embodiment, 
detail in a text entitled "The Java Virtual Machine during compilation, or loading, all instructions which can 
Specification", by Tim Lindholm and Frank Yellin, Addison- operate inc i isc riminately with either references or data are 
Wesley, 1996. 20 re pi ace d by equivalent, but reference-specific or data- 
According to the Java Virtual Machine (JVM) specific instructions. 
Specification, a local variable or stack slot in a computing _ . , , t „ . , . L 
system using 32-bit memory words may contain either a , ™ e mven } 10n enables f rba § e collection algorithms to 
32-bit integer, a 32-bit floating-point number, or a 32-bit locatc a11 references stored within a stack, since the refer- 
reference. Consequently, tagged data representation cannot 25 ences are a11 10 a more or less contiguous region of memory 
be used in all cases (programming languages that use tagged without interspersed primitive data. According to one aspect 
data representation on 32-bit computer architectures typi- of the invention, the two substacks representing a single 
cally restrict the size of integers to 30 bits.) However, it is stack occupy the same linear, contiguous region of memory 
not possible to distinguish references from data in all cases that otherwise would have been dedicated to the single stack, 
by examining Java instructions, because many instructions 30 

operate indiscriminately on references and data. BRIEF DESCRIPTION OF THE DRAWINGS 

Another technique by which references and primitive data ^ above and father advantages of the invention may be 

are identified and distinguished is to keep them on separate 5etter understood by referring to the following description in 

stacks. This technique was used in a system that also used conjunction with the accompanying drawings and which: 

garbage collection and goes back more than twenty years, to 35 1A . , U1 , .„ . 

B B . * . m , T • t-i__ir / 1 ™ FIG. 1A is a schematic block diagram illustrating a 

a version of the MacLisp system that had four stacks. Two . t , , e 4 • 

r , . j ; . 1 j * j i f\c *i_ computer system suitable for use with the present invention 

of these stacks contained references and two did not. Ot the r J r 

two that contained references, one contained ordinary sub- FIG * 1B * a conceptual illustration of a conventional 

routine arguments and one contained "dynamic bindings" of program data stack; 

so-called "special variables". Of the two that contained 40 1C is a conceptual diagram of a conventional pro- 
primitive data, one held integer values and one held floating- gram stack, including frames; 

point values. While it was true that the garbage collector did FIG. 2 is a conceptual diagram of a simple program stack 

not need to scan the integer and floating-point stacks while segregated into substacks in accordance with a first embodi- 

looking for references, the principal motivation for this ment of the present invention; and 

separation was not to use two or more stacks to simulate a 45 FIG. 3 is a conceptual diagram of a program stack, 

single logical stack, but rather to stack-allocate certain including frames, segregated into substacks in accordance 

numerical quantities that might otherwise have required with a second embodiment of the present invention, 

allocation in the garbage-collected heap, and to be able to piGS wheQ ^ h form a ^ chari 

tell whether such data was integer or floating-point by its mustrati the tion of the inventive stack allocation 

address. This required two separate stacks tor primitive data, 50 memoc j 
not just one. Subroutine arguments were never passed on the 

two primitive stacks. Rather, every argument was repre- DESCRIPTION OF THE PREFERRED 

sented as a reference on the main reference stack. Although EMBODIMENT 
some of these references were addresses of locations on the 

numerical stacks, this was a consequence of the more 55 The present invention provides a method and apparatus 

general mechanism for stack allocation of numerical values. which addresses the requirements of exact garbage collcc- 

Local variables of the reference type were allocated on the tion algorithms. Specifically, the invention addresses the 

reference stack and local variables of numerical type might, requirement of distinguishing references from non- 

or might not, be allocated on a numerical stack. If a local references within a data stack where data representations are 

variable of the numerical type was allocated on a numerical 60 untagged and references and primitive values have the same 

stack, it might be represented in a dual form where the size representations. 

numerical value was stored in a numerical stack and the Although the illustrative embodiments are described with 

address of the location on the numerical stack was stored in reference to a computer system implementing the Java 

the reference stack. A generalization of such a strategy was programming language and Java Virtual Machine 

used in the S-l LISP language. 65 specifications, the invention is equally applicable to other 

Accordingly, implementation of an exact garbage collec- computer systems having similar requirements. Specifically, 

tion algorithm with a computer system architecture which the invention may be implemented with both object oriented 
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and Don -object oriented programming systems, as well as steps needed in a program, while facilitating push-down 

multi threaded operating systems, having multiple stacks, as type storage. FIG. IB illustrates conceptually the structure 

well as operating systems having single threads or single of a conventional program stack residing in system memory, 

stacks. Prior to describing the invention in detail, a descrip- Three important positions in memory define the stack: the 

tion of a computer system suitable for implementing the 5 stack base, the stack pointer, i.e., the current stack top, and 

present invention, and a conventional stack architecture is lhe stack ^ ^P^ally, these positions are identified by 

provided for the reader's benefit. presses kept in three machine registers. 

Computer System and Stack Architecture . ™™ a dat ™ 1S t0 be P us * ed onto li * s ^ ed 

1 a '11 . t *u a „ . ^ f „,^ r „ ^ m „„ liir into the unused memory location nearest the stack pointer, 

illustrates the system architecture for a computer ^ f fc ^ ^ ^ Jf 

system 100 with which the invention may be implemented. io ^ ^ ^ ^ t0Q dose {Q thc s[ack ^ the stack is 

The exemplary computer system of FIG. 1 is for descriptive said tQ « overflow » and ^me special action must be taken, 

purposes only. Although the description may refer to terms si , an erfor Qr albcate more memory t0 hold the 

commonly used in describing particular computer systems, stack. 

such as in IBM PS/2 computer, the description and concepts Wnen a datum ^ t0 be poppe d from the stack, the stack 

equally apply to other computer systems such as network 15 pointer recedes toward the stack base, causing the memory 

computers, workstations and even mainframe computers, m at holds the datum to become regarded as unused memory 

having architectures dissimilar to FIG. LA. again. If the stack pointer is too close to the stack base, the 

Computer system 100 includes a central processing unit stack is said to "underflow" and some special action must be 

(CPU) 105, which may be implemented with a conventional taken, e.g. signal an error or switch to another region of 

microprocessor, a random access memory (RAM) 110 for 20 memory that holds more stack data. Depending on the 

temporary storage of information, and a read only memory implementation, the stack base may reside at either a higher 

(ROM) 115 for permanent storage of information, A or lower memory address then the stack limit, 

memory controller 120 is provided for controlling RMA In the Java programming language, data on the stack is 

110. grouped into "frames", as illustrated in FIG. 1C. Each frame 

A bus 130 interconnects the components of computer 25 corresponds to one level of subroutine invocation or method 

system 100. A bus controller 125 is provided for controlling invocation. Every frame is divided into three regions: 

bus 130. An interrupt controller 135 is used for receiving and parameters, local variables, and evaluation temporaries, 

processing various interrupt signals from the system com- Parameters are closest to the stack base while the evaluation 

ponents. temporaries are furthest from the stack base. Each of these 

Mass storage may be provided by diskette 142, CD ROM 30 three regions may be empty, depending on the particular 

147, or hard drive 152. Data and software may be exchanged subroutine to be executed with that frame. As a subroutine 

with computer system 100 via removable media such as executes, the number of evaluation temporaries may vary as 

diskette 142 and CD ROM 147. Diskette 142 is insertable items are pushed onto or popped from the stack, however, 

into diskette drive 141 which is, in turn, connected to bus 30 the number of parameters and local variables typically does 

by a controller 140. Similarly, CD ROM 147 is insertable 35 not vary. Consequently, different frames may have different 

into CD ROM drive 146 which is, in turn, connected to bus sizes. 

130 by controller 145. Hard disk 152 is part of a fixed disk To simplify addressing of parameters and local variables, 

drive 151 which is connected to bus 130 by controller 150. an additional address, the "frame pointer", typically kept in 

User input to computer system 100 may be provided by a a machine register, indicates start of the parameter area in 

number of devices. For example, a keyboard 156 and mouse 40 the current stack frame. Instructions can then access a 

157 are connected to bus 130 by controller 155. An audio parameter or local variable in the current frame by specify- 

transducer 196, which may act as both a microphone and a ing an offset from the current frame pointer, 

speaker, is connected to bus 130 by audio controller 197, as When a subroutine or method is to be invoked, some 

illustrated. It will be obvious to those reasonably skilled in items at the top of the evaluation stack become parameters 

thc art that other input devices, such as a pen and/or tabloid 45 within the new frame. The current frame pointer is saved on 

may be connected to bus 130 and an appropriate controller the stack along with the program counter. The frame pointer 

and software, as required. DMA controller 160 is provided is then loaded with the address of the new frame while the 

for performing direct memory access to RAM 110. A visual program counter is loaded with the address of the code for 

display is generated by video controller 165 which controls the subroutine. 

video display 170. Computer system 100 also includes a 50 Some computing systems are "multi-threaded" in which 

communications adaptor 190 which allows the system to be multiple ongoing computational processes share a single 

interconnected to a local area network (LAN) or a wide area address space or a single pool of objects. Such a set of 

network (WAN), schematically illustrated by bus 191 and processes typically has more than one stack or one stack for 

network 195. each process. Such multiple processes can be served by a 

Operation of computer system 100 is generally controlled 55 single storage management strategy, 

and coordinated by operating system software, such as the Segregated Stack Architecture 

OS/2® operating system, available from International Busi- A central feature of this invention is to implement every 

ness Machines Corporation, Boca Raton, Fla. The operating stack as two substacks, one to contain references and one to 

system controls allocation of system resources and performs contain primitive data. In this manner, whether a piece of 

tasks such as processing scheduling, memory management, 60 information on a stack is a reference or a primitive value is 

networking, and I/O services, among things. easily determined according to which substack it resides in. 

In conventional computing systems, an ongoing compu- Each substack is itself a full-fledged stack data structure 

tation uses a "stack" to support procedure calls and to hold with a stack base, a stack pointer, and a stack limit. In the 

intermediate computational quantities such as references, preferred embodiment, there is also a frame pointer for each 

and primitive values. Astack comprises an unused portion of 65 substack. 

internal memory which reduces the number of registers With the inventive technique described herein, a garbage 

needed for temporary storage and decreases the number of collection algorithm can easily locate all references stored 
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within a stack, since all references are in a more or less 
contiguous region of memory without interspersed primitive 
data. In the preferred embodiment as conceptually illustrated 
in FIG. 2, the two substacks representing a single stack 
occupy the same linear, contiguous region of memory that 
otherwise would have been dedicated to the single stack. 
The reference stack grows "downward" while the primitive 
stack grows "upward" so that the two stacks grow toward 
each other. One end of the linear contiguous memory region 
is indicated by the "reference stack base" and the other end 
by the "primitive stack base". In the preferred embodiment, 
there are two stack pointers. For the purposes of the stack 
push and pop operations, the reference stack pointer serves 
as the primitive stack limit, and the primitive stack pointer 
serves as the reference stack limit. In this manner, memory 
is not further fragmented since there is only one region of 
"unused memory", not two. 

FIG. 3 illustrates a program data stack having contents 
segregated into a pair of substacks in accordance with a first 
embodiment of the present invention. As with the conven- 
tional stack data on the stack is grouped into "frames", as 
illustrated in FIG. 3. Each frame corresponds to one level of 
subroutine invocation or method invocation. Every frame is 
divided into three regions: parameters, local variables, and 
evaluation temporaries. Parameters are closest to the stack 25 
base while the evaluation temporaries are furthest from the 
stack base. There are two sets of frames, one corresponding 
to the reference stack and one corresponding to the primitive 
stack. As the reference stack grows, the reference stack 
pointer moves downward and, as the primitive stack grows 
the primitive stack pointer moves upward. The unused 
memory space is located between the two stacks. 

With the segregated stack of the present invention, every 
operation on a stack that is to be performed by the ongoing 
computation must use the appropriate substack depending 
on whether the data to be used or transferred is reference 
data or primitive data. For example, the "iadd" instruction 
pops two operands from the stack, which are integers i.e. not 
references. Their sum is then pushed back onto the stack as 
an integer. Using the inventive two-stack technique, the 
"iadd" instruction is redefined to pop two operands from the 
PRIMITIVE stack. Their sum is then pushed back onto the 
PRIMITIVE stack as an integer. 

Similarly, the "aaload" instruction pops two items from 
the stack, which must be a reference to an array of references 
and an integer index. The instruction selects a reference 
from the array as indicated by the index value and then 
pushes a copy of the selected reference onto the stack. Using 
the inventive two-stack technique, the "aaload" instruction 
is redefined to pop one item from the REFERENCE stack, 
which must be a reference to an array of references, and to 
pop one item from the PRIMITIVE stack, an integer index. 
The instruction selects a reference from the array as indi- 
cated by the index value, and then pushes a copy of the 
selected reference onto the REFERENCE stack. 

Most of the other Java Virtual Machine bytecode instruc- 
tions may be similarly redefined so as to use the appropriate 
substack for each operand or result according to whether it 
is a reference value or a primitive value. A description of the 
redefinition of such other bytecode instructions according to 
the Java virtual machine specification will be within the 
scope of those skilled in the art in light of the description 
above and will not be provided herein. 

The following Java Virtual Machine bytecode instructions 
may be additionally be redefined to use one of two frame 
pointers: 



30 
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lload 


lload_0 


lload_l 


lload_2 


lload_3 


noad 


fload_0 


fload_l 


fload_2 


fload_3 


lload 


lload_0 


lload_l 


lload_2 


lload_3 


dload 


dload_0 


dload_l 


dload_2 


dbad_3 


aload 


aload_0 


a!oad_l 


a!oad_2 


aload_3 


is tore 


istore_0 


istore_l 


istore_2 


istore__3 


f store 


fstore_0 


fstore_l 


fstore_2 


fstore_3 


lstore 


lstore_0 


lstore_l 


lstorc_2 


1store__3 


dstore 


dstore__0 


dstore_l 


dstoie_2 


dstore__3 


astore 


dstore_0 


dstore_l 


dstore_2 


astore_3 
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Those whose name begin with "a" deal with reference 
values and therefore use the reference frame pointer. The 
others deal with primitive values and therefore use the 
primitive frame pointer. 

It is not possible, in this framework, to use instructions 
that are indiscriminate as to whether a stack operand is a 
reference or primitive data. Therefore, additional instruc- 
tions and two mechanisms are required, which are first 
summarized here and then elaborated upon below. 

For every instruction, whether an ordinary JVM instruc- 
tion or implementation-dependent "quick" instruction, 
if that instruction is indiscriminate, then the instruction 
is replaced by new instructions which are added to the 
instruction set In this manner, all instructions in the 
instruction set perform the relevant discriminations. 
The verifier is altered so that, as it traverses the code of 
a method (before the code for that method has ever 
been executed), whenever it examines any particular 
instance of a nominally indiscriminate instruction and 
determines which of its operands and results will be 
references, it replaces that instruction at verification 
time with one of several instructions that are logically 
equivalent but distinguish, for each stack operand or 
result, whether or not that operand or result is a 
reference, (In an alternate embodiment of the invention, 
a compiler might generate the necessary specialized 
instructions directly.) 
The mechanism that replaces slower instructions with 
quick instructions after symbolic name resolution is 
also modified so it chooses one of several possible 
quick instructions as the replacement. The choice of 
replacement instructions is made in such a manner to 
avoid losing information which would identify a par- 
ticular stack operand or result as a reference. 
Modified Instruction Set 

The standard JVM instructions pop, pop2, dup, dup2, 
dup_xl, dup_x2, dup2_xl, dup2_x2, swap are construed 
always to operate on primitive values, using the primitive 
stack. For reasons of naming symmetry, explained below, 
each is given one or more additional names. Their instruc- 
tion descriptions are therefore revised as follows. In every 
instruction description: 

Let vl mean the topmost item on the primitive stack. 
Let v2 mean the second item from the top on the primitive 
stack. 

Let v3 mean the third item from the top on the primitive 
stack. 

Let v4 mean the fourth item from the top on the primitive 
stack. 



pop - vpop 

Pop vl from the primitive stack (and discard). 
pop2 - wpop 
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-continued 

Pop vl from the primitive stack (and discard), then 

pop v2 from the primitive stack (and discard), 
dup - vdup - a__vdup - aa_vdup 

Push a copy of vl onto the primitive stack. 5 
dup2 - wdup - a_wdup - va_vdup 

Pop vl from the primitive stack, then 

pop v2 from the primitive stack, then 

push a copy of v] onto the primitive stack, then 

push a copy of v2 onto the primitive stack, then 

push another copy of vl onto the primitive stack. io 
dup_x2 « w_vdup 

Pop vl from the primitive stack, then 

pop v2 from the primitive stack, then 

pop v3 from the primitive stack, then 

push a copy of vl onto the primitive stack, then 

push a copy of v3 onto the primitive stackj then ^ 

push a copy of v2 onto the primitive stack, then 

push another copy of vl onto the primitive stack. 
dup2_xl = v__vvdup = av_wdup = va_vvdup 

Pop vl from the primitive stack, then 

pop v2 from the primitive stack, then 

pop v3 from the primitive stack, then 

push a copy of v 2 onto the primitive stack, then 

push a copy of vl onto the primitive stack, then 

push a copy of v3 onto the primitive stack, then 

push another copy of v2 onto the primitive stack, then 

push another copy of vl onto the primitive stack. 
dup2_x2 = w_wdup 

Pop vl from the primitive stack, then 25 

pop v2 from the primitive stack, then 

pop v3 from the primitive stack, then 

pop v4 from the primitive stack, then 

push a copy of v2 onto the primitive stack, then 

push a copy of vl onto the primitive stack, then 

push a copy of v4 onto the primitive stack, then 30 

push a copy of v3 onto the primitive stack, then 

push another copy of v2 onto the primitive stack, then 

push another copy of vl onto the primitive stack, 
swap - wswap 

Pop vl from the primitive stack, then 

pop v2 from the primitive stack, then ^5 
push a copy of vl onto the primitive stack, then 
push a copy of v2 onto the primitive stack. 



Also, the instruction "nop" is given two additional names: 



nop = avswap = vaswap 
Take no action. 



Certain of these instructions have more than one name, 45 
again for naming symmetry. To fully utilize the advanta- 
geous design of the segregated stack in accordance with the 
present invention, a number of new instructions are pro- 
posed as part of the invention. 

The behavior of the new additional instructions is as 50 
follows. In every instruction description: 

Let al mean the item that is topmost on the reference 

stack when the instruction begins execution. 
Let a2 mean the item that is second from the top on the 55 

reference stack when the instruction begins execution. 
Let a3 mean the item that is third from the top on the 
reference stack when the instruction begins execution. 
Let a4 mean the item that is fourth from the top on the 

reference stack when the instruction begins execution. 60 
Let vl mean the item that is topmost on the primitive 

stack when the instruction begins execution. 
Let v2 mean the item is second from the top on the 

primitive stack when the instruction begins execution. 65 
Let v3 mean the item that is third from the top on the 
primitive stack when the instruction begins execution. 



10 



apop 

Pop al from the reference stack (and discard), 
aapop 

Pop al from the reference stack (and discard) and 
pop vl from the primitive stack (and discard). 

avpop - vapop 

Pop al from the reference stack (and discard) and 
pop vl from the primitive stack (and discard). 

adup = v_adup = w_adup 

Push a copy of al onto the reference stack. 

aadup - v_aadup ■= w„aadup 

Push a copy of a2 onto the reference stack, then 
push a copy of al onto the reference stack. 

avdup = vadup 

Push a copy of al onto the reference stack, and 
push a copy of vl onto the primitive stack. 

a_adup ° av adup = va_adup 

Pop al from the reference stack, then 

pop a2 from the reference stack, then 

push a copy of al onto the reference stack, then 

push a copy of a2 onto the reference stack, then 

push another copy of al onto the reference stack. 

o_aadup - av__aadup - va_aadup 

Pop al from the reference stack, then 
pop a2 from the reference stack, then 
pop a3„ from the reference stack, then 
push a copy of a2 onto the reference stack, then 
push a copy of al onto the reference stack, then 
push a copy of a3 onto the reference stack, then 
push another copy of a2 onto the reference stack, then 
push another copy of al onto the reference stack. 

a_avdup = a vadup 

Pop al from the reference stack, then 

pop a2 from the reference stack, then 

push a copy of al onto the reference stack, then 

push a copy of a2 onto the reference stack, then 

push another copy of al onto the reference stack; and 

push a copy of vl onto the primitive stack. 

v_avdup - v_vadup 

Pop vl from the primitive stack, then 

pop v2 from the primitive stack, then 

push a copy of vl onto the primitive stack, then 

push a copy of v2 onto the primitive stack, then 

push another copy of vl onto the primitive stack; and 

push a copy of al onto the reference stack. 

aa_adup 

Pop al from the reference stack, then 

pop a2 from the reference stack, then 

pop a3 from the reference stack, then 

push a copy of al onto the reference stack, then 

push a copy of a3 onto the reference stack, then 

push a copy of a2 onto the reference stack, then 

push another copy of al onto the reference stack. 

aa_aadup 

Pop al from the reference stack, then 

pop a2 from the reference stack, then 

pop a3 from the reference stack, then 

pop a4 from the reference stack, then 

push a copy of a2 onto the reference stack, then 

push a copy of al onto the reference stack, then 

push a copy of a 4 onto the reference stack, then 

push a copy of a3 onto the reference stack, then 

push another copy of a2 onto the reference stack, then 

push another copy of al onto the reference stack. 

aa_avdup *» aa vadup 

Pop al from the reference stack, then 
pop a2 from the reference stack, then 
pop a3 from the reference stack, then 
push a copy of al onto the reference stack, then 
push a copy of a3 onto the reference stack, then 
push a copy of a2 onto the reference stack, then 
push another copy of al onto the reference stack; and 
push a copy of vl onto the primitive stack. 

w_avdup - w_vodup 

Pop vl from the primitive stack, then 

pop v2 from the primitive stack, then 

pop v3 from the primitive stack, then 

push a copy of vl onto the primitive stack, then 

push a copy of v3 onto the primitive stack, then 

push a copy of v2 onto the primitive stack, then 
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push another copy of vl onto the primitive stack; and 
push a copy of a 1 onto the reference stack. 

av_avdup - va_avdup - av_vadup - va__vadup 
Pop al from the reference stack, then 
pop a2 from the reference stack, then 
push a copy of al onto the reference stack, then 
push a copy of a2 onto the reference stack, then 
push another copy of al onto the reference stack; and 
pop vl from the primitive stack, then 
pop v2 from the primitive stack, then 
push a copy of vl onto the primitive stack, then 
push a copy of v2 onto the primitive stack, then 
push another copy of vl onto the primitive stack. 

aaswap 

Pop al from the reference stack, then 

pop a2 from the reference stack, then 

push a copy of al onto the reference stack, then 

push a copy of a2 onto the reference stack. 



15 



In the preferred embodiment whenever the verifier exam- 
ines an instance of any of the pop, pop2, dup, dup2, dup_xl, 
dup_x2, dup2_„xl, dup2_x2, or swap instructions the veri- 
fier will have determined for each item on the stack whether 
that item will always be a reference or always be a primitive 
value when that instruction is executed. The verifier then 
forms the name of a replacement instruction as follows: 

(a) For pop or pop2, use the base name "pop". 

For dup, dup2, dup_xl, dup_x2, dup2_xl, or dup_ 

x2, use the base name "dup". 
For swap, use the base name "swap". 

(b) For pop or dup, prefix one letter to the base name, 
representing the topmost stack item. 

For pop2, dup_xl, dup2, or swap, prefix two letters to 
the base name, representing the two topmost stack 
items, with the second letter of the resulting name 
representing the topmost stack item and the first 
letter of the resulting name representing the next 
stack item. 

For dup_x2 or dup2_xl, prefix three letters to the base 
name, representing the three topmost stack items, 
with the third letter of the resulting name represent- 
ing the topmost stack item, the second letter of the 
resulting name representing the next stack item, and 
the first letter of the resulting name representing the 
item below that. 

For dup2„x2, prefix four letters to the base name 
representing the four topmost slack items, with the 
fourth letter of the resulting name representing the 
topmost stack item, the third letter of the resulting 
name representing the next stack item, the second 
letter of the resulting name representing the next 
stack item, and the first letter of the resulting name 
representing the item below that. 

(c) For pop, pop2, dup, dup2, or swap, no further action 
is necessary. 

For dup_xl or dup2 xl, insert the character "__" 

between the first and second letters of the name 
resulting from step (b). 
For dup_x2 or dup2_x2, insert the character "_" 
between the second and third letters of the name 
resulting from step (b). 
The instruction so named is then used to replace the 
original instruction. Note that the replacement instruction 
may be the same as the original instruction, under another 
name. The names of these instructions are chosen so that the 
letters "a" and "v" in the prefix form a picture of the stack 
to be operated upon, considering the stack in its logical, 
single-stack form. The letter for the topmost stack item is 



nearest the base name of the operation ("pop", "dup", or 
"swap"). If "_" is present in the name, it separates items 
acted upon by the instruction from items that are skipped 
over. The items acted upon are adjacent to the operation 
name "pop", "dup", or "swap". 
Instruction Mapping Mechanisms 

Alternatively, rather than the verifier performing the pro- 
cess steps of constructing the instruction names as described 
above, replacement of names can be performed through the 
use of a decision table or other mechanisms for mapping an 
instruction name and a set of binary values to a new 
instruction name, A decision table in accordance with the 
illustrative embodiment of the invention which is capable of 
performing the mapping of instruction names to a new 
instruction is illustrated in Table 1. Utilization of a decision 
table as illustrated in Table 1 eliminates the need for 
redundant names for instructions. In Table 1, an "X" signi- 
fies a "don't care condition. 
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TABLE 1 


original 


fourth 


third 


next 


top of 


replacement 


instruction 


item 


item 


to top 


stack 


instruction 


pop 


X 


X 


x 


a 


apop 


pop 


X 


X 


x 


V 


pop 


pop2 


X 


X 


a 


a 


aapop 


pop2 


X 


X 


V 


a 


avpop 


pop2 


. X 


X 


a 


V 


avpop 


pop2 


X 


X 


V 


V 


pop2 


dup 


X 


X 


X 


a 


adup 


dup 


X 


X 


X 


V 


dup 


dup2 


X 


X 


a 


a 


aadup 


dup2 


X 


X 


V 


a 


avdup 


dup2 


X 


X 


a 


V 


avdup 


dup2 


X 


X 


V 


V 


dup2 


dup_jcl 


X 


X 


a 


a 


a adup 


dup__xl 


X 


X 


V 


a 


adup 


dup_xl 


X 


X 


a 


V 


dup 


dup_xl 


X 


X 


V 


V 


dup__xl 


dup_x2 


X 


a 


a 


a 


aa_adup 


dup_x2 


X 


V 


a 


a 


a_adup 


dup_x2 


X 


a 


V 


a 


a_adup 


dup_ x2 


X 


V 


V 


a 


adup 


dup_jt2 


X 


a 


a 


V 


dup 


dup_x2 


X 


V 


a 


V 


dup__xl 


dup_x2 


X 


a 


V 


V 


dup_xl 


dup_x2 


X 


V 


V 


V 


dup__x2 


dup2__xl 


X 


a 


a 


a 


a_adup 


dup2_xl 


X 


V 


a 


a 


aadup 


dup2„xl 


X 


a 


V 


a 


a_avdup 


dup2__xl 


X 


V 


V 


a 


v_avdup 


dup2_jcl 


X 


a 


a 


V 


a_avdup 


dup2_xl 


X 


V 


a 


V 


v__avdup 


dup2_xl 


X 


a 


V 


V 


dup2 


dup2_xl 


X 


V 


V 


V 


dup2_xl 


dup2_x2 


a 


a 


a 


a 


aa_aadup 


dup2_x2 


V 


a 


a 


a 


a aadup 


dup2_x2 


a 


V 


a 


a 


a_aadup 


dup2__x2 


V 


V 


a 


a 


aadup 


dup2_x2 


a 


a 


V 


a 


aa_avdup 


dup2_x2 


V 


a 


V 


a 


av_avdup 


dup2_x2 


a 


V 


V 


a 


av_avdup 


dup2„x2 


V 


V 


V 


a 


w_avdup 


dup2_x2 


a 


a 


a 


V 


aa_avdup 


dup2_x2 


V 


a 


a 


V 


av_avdup 


dup2__x2 


a 


V 


a 


V 


av_avdup 


dup2__x2 


V 


V 


a 


V 


vv _avdup 


dup2_x2 


a 


a 


V 


V 


dup2 


dup2_jc2 


V 


a 


V 


V 


dup2_jcl 


dup2__x2 


a 


V 


V 


V 


dup2_xl 


dup2_x2 


V 


V 


V 


V 


dup2_x2 


swap 


X 


X 


a 


a 


aaswap 


swap 


X 


X 


V 


a 


nop 


swap 


X 


X 


a 


V 


nop 


swap 


X 


X 


V 


V 


swap 
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In the preferred embodiment of this invention, Java byte- 

code instruction set opcodes may be assigned to the new 

instructions as follows: 



237 


apop 


238 


aapop 


239 


avpop 


240 


aaswap 


241 


adup 


242 


aadup 


243 


avdup 


244 


a_adup 


245 


a_aadup 


246 


a_avdup 


247 


v_avdup 


248 


aa_adup 


249 


aa_aadup 


250 


aa_avdup 


251 


av_avdup 


252 


w_avdup 



(2) The semi-standard JVM quick instructions 

getfield_quick 

getfield„quick_w 

getstatic_quick 

ldc__quick 

ldc_w__quick 

putfield_quick 

putfie ld_quick_w 

putstatic__quick 
are construed always to operate on primitive values, using 
the primitive stack. The invention contemplates these addi- 
tional quick instructions, which are exactly like the corre- 
sponding existing instructions without "a" beginning their 
name, but operate on reference values, using the reference 
stack: 

agetfield_quick 

agetfield__quick__w 

agetstatic quick 

aldc_quick 

aldc_w_quick 

aputfield_quick 

aputfield_quick_w 

aputstatic_quick 

Whenever a slower instruction such as getfield, getstalic, 
ldc, ldc_w, putfield, or putstatic is to be replaced by a quick 
instruction during the process of executing the slower 
instruction, a quick instruction whose name begins with "a" 
is chosen if the symbolic name resolution process deter- 
mines that the datum to be transferred to or from the stack 
will be a reference; otherwise the usual quick instruction is 
chosen. The instruction agetfield„_quick is identical in its 
function to getfield_quick, except in that agetfield_quick 
uses the reference stack. Similar analogies are for the other 
seven cited quick instruction types. 

In the preferred embodiment of this invention, Java byte- 
code instruction set opcodes may be assigned to the new 
instructions as follows: 



232 getfield_ieference_quLck 

233 getfield_reference_quick_w 

234 getstatic„reference_quick 

235 ldc„reference_quick 

236 ldc_reference„w__ 

231 putfield_reference_quick 



14 

-continued 

229 putfield_reference_quick_w 

230 putstatic_reference_quick 



Space Optimization Technique 

An additional optimization technique is contemplated 
herein, which is not necessary for implementation of the 
invention but which improves space utilization. This tech- 
nique is carried out by the verifier at the same time that the 
verifier is replacing pop, dup, and swap operations with 
more specialized versions while analyzing the code for a 
method. The objective of the optimization is to reduce the 
space needed on the stacks for local variables. The unopti- 
mized technique simply reserves space for all the local 
variables of a method on both the primitive stack and the 
reference stack. This uses twice as much space for the local 
variables as would be necessary with the usual single-stack 
representation, but has the advantage that instructions that 
access local variables can be used unchanged. 

The optimization technique is implemented with an 
inventive method which first analyzes, for each local vari- 
able slot, whether the slot is ever used to hold a reference 
value and whether it is ever used to hold a primitive value. 
A local variable in the JVM may be used to hold a primitive 
value in some parts of the code for a method and to hold a 
reference value in other parts of the code for the same 
method. The slots are then gathered into two sets that are not 
necessarily disjoint, i.e. a first set of all local variable slots 
used for primitive values and a second set of all local 
variable slots used for reference values. In the illustrative 
embodiment, let P be the number of slots in the first set and 
let R be the number of slots in the second set. 

The slots in each set are renumbered so as to have 
consecutive numbers starting with zero. Specifically, for 
each set, for each slot S in that set, if the set contains N other 
slots whose original offset is less than the original offset of 
slot S, then slot S is renumbered to have offset N. This 
particular renumbering strategy advantageously allows slots 
that originally have consecutive offsets to be renumbered so 
as to have consecutive offsets in the same order. Next, once 
the slots are renumbered, every occurrence in the code of 
any of the following instructions is rewritten by the JVM 
verifier to refer to the new location of the stack slot, relative 
to the appropriate frame pointer, and according to the 
renumbering: 



iload 
fload 

50 lload 
dload 
a Load 
istore 
fstore 
Istore 

55 dstore 
astore 



is rewritten by the verifier to refer to the new location of the 
stack slot, relative to the appropriate frame pointer, accord - 

60 ing to the renumbering. 

Next, the results are annotated so that when the method is 
invoked, P slots are allocated for local variables on the 
primitive stack and R slots are allocated for local variables 
on the reference stack. The amount of excess memory used 

65 for local variables will then be equal to the size of the 
intersection of the two sets; if this intersection is empty, 
because no local variable slot is ever used for a reference 



iload_0 
fload_0 
lload_0 
dload_0 
aload_0 
istore_0 
fstore_0 
lstore_0 
dstore_0 
astore__0 



iload_l 
fload__l 
Uoad_l 
dload_l 
aload_l 
istore_l 

lstore_l 



iload_2 
fload_2 
lload_2 
dload_2 
aload_2 
istore_2 
fstore__l 
lstore„2 
dstore_J 
astore_l 



iIoad_3 
fload„3 
lload_3 
dload_3 
aload_3 
istore_3 
fstore_2 
lstore_J 
dstore„2 
astore_2 



fstore_3 

dstore_3 
astore_3 
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value and also for a primitive value, then no excess memory 
will be allocated for that method. 

For the reader's benefit, a specific example of the inven- 
tive space optimization method is provided. Suppose that a 
method uses six local variable slots. Slots 0, 3, and 4 are 
each used at certain points in the code of the method to hold 
a reference value. Slots 1 and 2 are used together at certain 
points in the code of the method to hold a double-precision 
floating-point value. Slots 1, 3, and 5 are each used at certain 
points in the code of the method to hold an integer value. 
The verifier, as it traverses the code for the method, deter- 
mines the following slot information: 



o 

reference 



double - floating 
integer 



3 

reference 
- point 
integer 



4 

reference 



integer 



The verifier then constructs the set of local variable slots 
for reference values {0, 3, 4} and the set of local variable 
slots for primitive values {1, 2, 3, 5}. The reference value 
slots are renumbered: 

3^1 
4^2 

and the primitive value slots are renumbered 

2- *l 

3- >2 
5->3 

Note that the consecutive slots 1 and 2, used to hold a double 
floating point value, are renumbered to consecutive slots 0 
and 1. 

Next, for example, an instruction sequence such as 
iload_3 
istore 5 
dload_l 
aload 4 
astore_3 

would be written by the verifier, using the slot renumberings, 
as follows: 



iload_2 


3 


-* 2 in primitive renumbering 


istore 3 


5 


-* 3 in primitive renumbering 


dload_0 


1 


-+ 0 in primitive renumbering 


aload 2 


4 


-* 2 in reference renumbering 


astore_l 


3 


-» 1 in reference renumbering 



When the method is invoked, four slots are allocated for 
local variables on the primitive stacks and three slots are 
allocated for local variables on the reference stack, for a total 
of seven slots. This total exceeds the six slots needed in the 
one-stack representation by one slot, which is the size of the 
intersection of the two sets. If the optimization technique 
were not used, six slots would be allocated on each of the 
two substacks for a total of 12 slots. 

Note, a short one -byte instruction, such as iload_2, can 
always be rewritten as a short instruction, because the 
renumbering gives a slot a new offset that is not greater than 
its old offset. Also, the same offset (3, in this example) can 
be rewritten to different offsets depending on whether the 
instruction handles reference data or primitive data. 

FIG. 4 is a flowchart detailing an illustrative routine 
which computes slack sizes and sets up the program sub- 
stacks in accordance with the principles of the invention. 



Although a single flowchart is shown, it should be under- 
stood that, depending on the implementation, some of the 
illustrated steps may be performed at different times. For 
example, instruction replacement as detailed in steps 

5 402-406 may be performed at compilation or loading time, 
whereas stack allocation (steps 408-432) is generally per- 
formed during loading. The actual set-up of the stacks as set 
forth in steps 434-436 is performed by the operating system 
at the time of initial execution. 

no The routine shown in FIG. 4 begins in step 400 and 
proceeds to step 402, where each instruction, or bytecode, is 
examined. As previously mentioned, this examination can be 
performed during program load by the JVM verifier func- 
tion. The instructions are examined to determine whether 

15 they are reference specific, primitive data specific or indis- 
criminate. In step 404, a decision is made based on the 
examination in step 402 whether the instruction is indis- 
. criminate. 

If the instruction is indiscriminate, the routine proceeds to 

20 step 406, where the indiscriminate instruction is replaced 
with one or more discriminate instructions as previously 
described. Alternatively, if, in step 404, it is determined that 
the instruction discriminates between reference values and 
primitive data, then the routine proceeds directly to step 408. 

25 In step 408, a check is made to determine whether the 
instruction under examination reads or writes primitive data 
into a destination variable. If the instruction does read or 
write primitive data, then the destination variable is labeled 
as a primitive variable in step 410. Otherwise, the routine 

30 proceeds to step 412. 

In step 412, a check is made to determine whether the 
instruction under examination reads or writes a reference 
into a destination variable. If the instruction does read or 
write a reference, then the destination variable is labeled as 

35 a reference variable in step 414, Otherwise, the routine 
proceeds to step 416. 

In step 416, a check is made to determine whether any 
more instructions in the program stream remain. If there are 
remaining instructions, the routine returns to step 402 to 

40 examine the next instruction. If no instructions remain, the 
routine then proceeds to renumber the variables in steps 
418-420 and then alter the instructions to use the renum- 
bered variable in steps 422-432. Finally, stack slots are 
allocated in steps 434 and 436. 

45 In particular, in step 418, the number of primitive vari- 
ables (P) is determined from the labeled variables, and the 
primitive variables are renumbered as described above. 
Next, in step 420, the number of reference variables (R) is 
determined, and these are renumbered. It should be noted 

50 that it is possible for the same local variable to be labeled as 
a primitive variable, a reference variable or both. 

Subsequently, in steps 422-432, a second pass is made 
over the program instructions to alter the instruction to use 
the renumbered variable values. In particular, the instruc- 

55 tions are re-examined in step 422. In step 424, a check is 
made to determine whether the instruction under examina- 
tion reads or writes primitive data into a destination variable. 
If the instruction does read or write primitive data, then the 
instruction is altered to use the renumbered value of the 

60 primitive destination variable in step 426. Otherwise, the 
routine proceeds to step 428. 

In step 428, a check is made to determine whether the 
instruction under examination reads or writes a reference 
into a destination variable. If the instruction does read or 

65 write a reference, then, in step 430, the instruction is altered 
to use the renumbered reference variable number. 
Otherwise, the routine proceeds to step 432. If step 432 
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determines that additional instructions remain to be exam- 
ined the routine proceeds back to step 422. 

Next, in steps 434 and 436, the first and second sub-stacks 
are defined using the allocations made in steps 418 and 420. 
More particularly, in step 434, P slots are allocated for 
primitive local variables are pushed onto a first substack 
using the first substack pointer as the stack pointer and the 
second substack pointer as the first stack limit as previously 
described. Next, in step 436, R slots for the reference local 
variables are pushed onto a second sub-stack with and the 
second sub-stack limit and points set as described above. 
The routine then ends in step 438. 

In an alternate embodiment of this work, all stack items 
might be capable of holding a single primitive datum, even 
long and double quantities. For example, all stack items 
would be 64 bits, and reference could be as large as 64 bits 
rather than 32 bits. In this case memory would be wasted 
when a single stack slot were used to hold a 32-bit integer 
or floating-point datum, or two stack slots were used to hold 
a long or double datum. 

In yet another alternate embodiment, the invention may 
be implemented as a computer program product for use with 
a computer system. In such an implementation, all the 
functions performed may be implemented with correspond- 
ing processor instructions. Any number of programming 
languages including most processor instruction sets, assem- 
bly languages and the C programming language provide 
facilities which allow for the manipulation of individual bits 
within a data word, and would be suitable for implementa- 
tion of such and all software embodiment of the invention. 
Such implementation may comprise a series of computer 
instructions either fixed on a tangible medium, such as a 
computer readable media, e.g. diskette, CD-ROM, ROM, or 
fixed disk, or transmitfable to a computer system, via a 
modem or other interface device, such as communications 
adapter connected to a network over a medium. Medium can 
be either a tangible medium, including but not limited to 
optical or analog communications lines, or may be imple- 
mented with wireless techniques, including but not limited 
to microwave, infrared or other transmission techniques. 
The series of computer instructions embodies all or part of 
the functionality previously described herein with respect to 
the invention. Those skilled in the art will appreciate that 
such computer instructions can be written in a number of 
programming languages for use with many computer archi- 
tectures or operating systems. Further, such instructions may 
be stored using any memory technology, present or future, 
including, but not limited to, semiconductor, magnetic, 
optical or other memory devices, or transmitted using any 
communications technology, present or future, including but 
not limited to optical, infrared, microwave, or other trans- 
mission technologies. It is contemplated that such a com- 
puter program product may be distributed as a removable 
media with accompanying printed or electronic 
documentation, e.g., shrink wrapped software, preloaded 
with a computer system, e.g., on system ROM or fixed disk, 
or distributed from a server or electronic bulletin board over 
a network, e.g., the Internet or World Wide Web. 

Although various exemplary embodiments of the inven- 
tion have been disclosed, it will be apparent to those skilled 
in the art that various changes and modifications can be 
made which will achieve some of the advantages of the 
invention without departing from the spirit and scope of the 
invention, such modifications to the inventive concept are 
intended to be covered by the appended claims. 

What is claimed is: 

1. In a computer system having a processor and a memory, 
a method for discriminating whether a datum useful in 
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execution of program instructions by the processor is one of 
a plurality of operand types, the method comprising the steps 
of: 

A. defining a first area in the memory as a first program 
data stack capable of accommodating a datum of a first 
operand type, the first program data stack having a first 
stack base address and a first stack limit address which 
define the extent of the first program data stack; 

B. defining a second area in the memory as .a second 
program data stack capable of accommodating datum 
of a second operand type, the second program data 
stack having a second stack base address and a second 
stack limit address which- define the extent of the 
second program data stack; 

C. storing data of the first operand type within the first 
program data stack, and defining a first stack pointer 
which identifies the current top of the first data stack, 
the first data stack pointer having an address value 
equal to the second stack limit; and 

D. storing data of the second operand type within the 
second program data stack, and defining a second 
program slack pointer which identifies the current top 
of the second data stack, the second stack pointer 
having an address value equal to the first stack limit. 

2. The method of claim 1, wherein the first program stack 
base and the second program stack base define a linear, 
contiguous region of the memory. 

3. The method of claim 1, wherein data operands of the 
first type are references. 

4. The method of claim 1, wherein operands of the second 
data type are primitive values. 

5. A method for discriminating between first and second 
operand data types in a program stack, the first and second 
operand data types being useful in execution of a program 
instruction, the method comprising the steps of: 

A. identifying indiscriminate program instructions in a 
sequence of instructions, which indiscriminate program 
instructions operate with both the first and the second 
operand data types; 

B. replacing each indiscriminate program instruction with 
at least one discriminate program instruction, which 
discriminate program instruction operates with one of 
the first and the second operand data types; 

C. storing operand data associated with a discriminating 
instruction that operates with the first operand data type 
in a first program data stack; and 

D. storing operand data associated with a discriminating 
instruction that operates with the second operand data 
type in a second program data stack. 

6. The method of claim 5, wherein step B comprises the 
step of: 

B.l mapping an indiscriminate program instruction iden- 
tifier to a replacement discriminate program instruction 
identifier by using a look-up table. 

7. The method of claim 5, wherein step B comprises the 
step of: 

B.2 mapping an indiscriminate program instruction iden- 
tifier to a replacement discriminate program instruction 
identifier during instruction verification. 

8. A computer system comprising: 
a processor; 

a memory coupled to the processor; 

program stack logic, operatively coupled to the processor 
and the memory and configured to store a first type of 
operand datum in a first plurality of stack entries, and 
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to store a second type of operand datum in a second 
plurality of stack entries; 
the first plurality of stack entries forming a first program 
substack defined by a first substack base address, a first 
substack limit address, and a first substack pointer 5 
address; 

the second plurality of stack entries forming a second 
program substack defined by a second substack base 
address, a second substack limit address, and a second 
substack pointer address; and 

wherein the first substack pointer address and the second 
substack limit address are the same address. 

9. The computer system of claim 8, wherein the second 
substack pointer address and the first substack limit address 35 
are the same address. 

10. A computer program product for use with a computer 
system having a processor and a memory coupled to the 
processor, the computer system configured to store operand 
datum in a plurality of stack entries and to manipulate the 2Q 
stack entries during program instruction execution, the com- 
puter program product comprising a computer usable 
medium having program code embodied in the medium for 
enabling discrimination of stack entries as one of a plurality 

of predefined operand types, the program code comprising: 25 
program code configured to define a first data stack to 
accommodate operand datum of a first predefined oper- 
and type, the first data stack having a first stack base 
address, first stack limit address, and a first stack 
pointer defining the current top of the first stack; 30 
program code configured to define a second data stack to 
accommodate operand datum of a second predefined 
operand type, the second data stack having a second 
stack base address, second stack limit address, and a 
second stack pointer defining the current top of the 35 
second stack; and 
the first stack limit address being the same address as the 
second stack pointer address. 

11. The computer program product of claim 10 wherein 
the second stack limit address is the same as the first stack 40 
pointer address. 

12. The computer program product of claim 10 further 
comprising program code configured to store operand datum 
of the first type in the first data stack; and 

program code configured to store operand datum of the 
second operand type in the second data stack. 

13. The computer program product of claim 10, further 
comprising: 

program code, responsive to an indiscriminate program 5Q 
instruction which accommodates operand data of more 
than one predefined type, configured to convert the 
indiscriminate program instruction into discriminate 
instructions which manipulate operand data of a single 

type- 55 

14. The computer program product of claim 10, wherein 
operand data of the first type are references. 

15. The computer program product of claim 10, wherein 
operand data of the second type are primitive values. 

16. A method for allocating memory slots for a program 
data stack, the program data stack useful for accommodating 
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a plurality of operand data types during execution of a 
program instruction, the method comprising the steps of: 

A. determining from a sequence of program instructions, 
a first number representing program data stack entries 
required to accommodate datum of a first operand data 
type; 

B. determining from a sequence of program instructions, 
a second number representing program data stack 
entries required to accommodate datum of a second 
operand data type; and 

C. allocating a total number of slots for the program data 
stack equal to a sum of the first number and the second 
number. 

17. The method of claim 16 further comprising the step of: 

D. analyzing each instruction in the instruction sequence 
to determine the operand data types with which the 
instruction is compatible. 

18. The method of claim 16, further comprising the step 

of: 

E. storing datum of the first operand data type in a first 
plurality of contiguous slots in the program data stack. 

19. The method of claim 16, further comprising the step 

of: 

F. storing datum of the second operand data type in a 
second plurality of contiguous slots in the program data 
stack. 

20. The method of claim 16, wherein step C comprises the 
steps of: 

C.l allocating on a first substack a number of slots equal 

to the first number; and 
C.2 allocating on a second substack a number of slots 

equal to the second number. 

21. A method for garbage collection for use in a computer 
system having a processor and a memory and a program 
including a plurality of instructions in the memory, the 
method comprising the steps of: 

A. identifying indiscriminate program instructions in a 
sequence of instructions, which indiscriminate program 
instructions operate with both primitive data type and 
reference type operands; 

B. replacing each indiscriminate program instruction with 
at least one discriminate program instruction, which 
discriminate program instruction operates with one of 
primitive data or reference type operands; 

C. storing operand data associated with a discriminating 
instruction that operates with the primitive data type 
operands in a first program data stack; 

D. storing operand data associated with a discriminating 
instruction that operates with the reference type oper- 
ands in a second program data stack; and 

E. using the reference type operands in the second pro- 
gram data stack to locate unused memory locations so 
that the unused memory locations can be reclaimed. 

22. The method of claim 21, wherein the first program 
stack base and the second program stack base define a linear, 
contiguous region of the memory. 
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